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ABSTRACT 



It is the purpose of these two reports to demonstrate with 
very simple systems, several applications of the optimization theory 
of L. S. Pontryagin (1). An attempt to synthesize the control logic 
using a modification of dynamic programming is developed. 

Part 1 presents the solution to the minimum time problem and 
the minimum fuel problem for a second order system with one zero. 
The zero causes discontinuities in the state variables. Control dif- 
ficulties are encountered when there are discontinuities in the error 
states, but these are alleviated by a transformation to a system with 
continuous variables. Optimum control of the transformed system is 
then accomplished using the methods of Pontryagin, The control 
action is then related back to the original plant. Although the in- 
vestigation is concerned entirely with second order systems , the 
methods are sufficiaitly general to be extended to higher order 
systems with zeros. In part II an example of the extension is given. 
Also Part II presents various schemes for the synthesis of optimum 



controls . 
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1. Introduction. 



When controlling the performance of a system, it is often desir- 
able to choose the control that will minimize errors in the system and 
do it in the shortest possible time. A practical matter that must be 
considered in the optimization in relation to rapid action is the fact 
that control is of a bounded nature. In a great many important cases, 
the constraint on the magnitude of the control effort precludes the use 
of classical variational techniques to design the controller. 

In 1956 Pontryagin hypothesized his "maximum priciple" which 
has since been proven a necessary condition for the optimization of 
linear systems in relation to rapid action /l/. In solving the mini- 
mum time problem for linear systems with bounded control, the 
principle leads to a "bang-bang" form of control law. This implies 
that the control effort is always being applied at its maximum value. 
There remains, however, the task of finding the optimum time to 
switch the control. Pontryagin' s method leads to a rule for switching 
the controller which is a function of the initial conditions in the 
system adjoint to the one being controlled. Generally these initial 
conditions are difficult to find. 

It is usually helpful to consider the control problem using state 

th 

space techniques. The coordinates of the space for an n order 
system here are a displacement error and its n-1 time derivatives. 
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The space may be divided into two regions each of which is character- 
ized by the control optimal for the trajectories in that region. Optimum 
switching between the two conditions of the bang-bang control occur 
on the hypersurface dividing the space. The switching criteria can 
then be stated as a function of the state space variables. 

Of considerable value in finding the switching surface is the sys- 
tem adjoint to the system. The adjoint can be thought of as the system 
running in reverse time. By plotting trajectories from the origin of the 
error phase space "backwards" in time, with the control satisfying the 
respective adjoint variables, a surface is generated which may be re- 
lated to the optimal switching surface in the system state space. 

A problem of interest occurs when the system is of such a nature 
that when control is applied, a discontinuity appears in one or more 
of the system states. This may happen when the control is of a bang- 
bang form and the forward transmission path of the system contains 
zeros. It could also show up if the control is of such a form that it 
approximates an impulse to the system. When there are discontinui- 
ties in the state space due to switching it is generally no longer 
possible to write the switching criteria as a function of the state 
space variables. 

One alternative might be to switch the control as a function of 
time. This may be done effectively when the number of switchings 
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to reach the origin of the error state space is no more than n-1 in an 
th. 

n order system. Such a restriction limits one mainly to considering 
only those systems with real, distinct eigenvalues. Large disturbances 
in lightly damped (oscillatory) systems may require more than n-1 
switchings to zero the error states. The most important consideration 
when controlling as a function of time is the means of implementing 
the switching logic. To accomplish time dependent control, it is vir- 
tually mandatory that a digital computer be inserted in the control loop. 

Another approach to the problem is to find a system that reacts 
identically to the system with zeros except at the points of disconti- 
nuity. Control of this parallel system can be stated in terms of the 
state space variables. This logic can then be used to switch the 
original plant. 

This paper will be an investigation into the latter method. The 
problem is as follows: 

Given a second order oscillatory system with one zero, 
find the optimum control for zeroing the errors in the 
system in minimum time and for zeroing the errors with 
minimum fuel. 

The method of Pontryagin is used to solve the problem. The brief 
description of the method presented here is based on the work of 
Rozonoer /l/. 



3 



2 . Pontryagin’s maximum principle . 



Given the system state variables described by n first order 
differential equations 

= f^ (x/ii«t) i = 1 ....... ,n (1) 

where x is a column vector in phase space and jj is a column con- 
trol vector consisting of r control elements. 

The control ja(t) must belong to a closed subset U of admissi- 
ble controls and must be piecewise continuous. The trajectory xW 
in the phase space is uniquely determined by (1) when control jj(t) 
and the initial conditions 




x(0) = x° = . 




( 2 ) 



are given. 

The control ^(t) of a system may be considered optimum under a 
variety of criteria. A large class of optimization problems may be 
solved by presenting the criteria in such a way that the solution is 
attained by minimizing a linear function of the final value of the state 
space variables. A control must be selected from U that will trans- 
fer the system (1) from x° to some fixed closed set G of the phase 
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space such that 



n+1 

S = S c.x.(T) (3) 

^11 

is a minimum. The constants c, and the x , . coordinate are chosen 

1 n+1 

such that minimizing (3) optimizes the system. 

In a great many cases optimization of only one of the coordinates 
of the system is desired. For example, in order to optimize the mag- 
nitude of .j, 

f F(x(t), u(t)) dt (4) 

*'0 

for T and x(T) either fixed or free in a system (1) for u(t)eU, a 
new variable is introduced: 

^n+ 1 ^ Jo ^ 



and another differential equation 

= F(x(t), u(t)) 

is added to (1). The problem of optimizing the integral leads to 
optimizing x^^^(T) at t=T. 

Minimizing x^^j^(T) in the system (1) with adjoined is 

accomplished by putting the problem in functional form (3) and apply- 
ing the maximum principle to gain the solution. That is 

n+1 

S = L c.x^(T) = Xj^^j(T) (6) 
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I 



is the functional to be minimized. Here it may be seen that 

. . . , c =0 and c . = 1 . 

n n+1 

A new dependent variable £(t) is now formed such tljat 



The function 



n+1 3f (x j _u, t) 

p.(t) = -L P^ — 

1 I s bx. 



n+1 

H = L p f (x, u, t) 
, s s^ — 



i = 1 ,n+l (7) 



( 8 ) 



is introduced from which equations (1) and (7) may now be written 



bH 

X. = -r 

1 5p. 



SH 

^i Bx. 



i = 1 ,n+l (9) 

i 1 

The control u*(t) is said to satisfy the maximum condition if 

H(x*(t) ,£*(t) ,u*(t)) reaches an absolute maximum at each time t 

(0 ^ t ^ T) where x* (t) and 2 .*(t) are the values of the variables at 

time t with u*(t) € U controlling. For linear systems of the type 

discussed in this paper, the necessary and sufficient condition for 
n+1 

minimizing S = Z c.x,(T) optimally with admissible control is that 
1 ^ ^ 

the control satisfy the maximum condition. 

To use the maximum principle, H is formed and maximized with 
respect to Ji(t). This produces a 

u*(t) = 0 (x, £) (10) 

which may be used with eqviations (9) and the boundary conditions 
to find ii*(x). If the end point of x(t) is not fixed, it becomes 
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necessary to obtain boundary conditions on £(t) in order to arrive at 

a solution. The conditions £(T) may be found using a function 

F(x) ^ 0 which describes G and x^(T) €G, the end point of an opti 

mum trajectory. The form of £(T) will be stated without detailed 

explanation; however, it may be noticed that at time t = T, £(T) is 

n+1 2 

orthogonal to a hyperplane E a.(x. - x. ) = 0 through the endpoint of 

ill 1 

the trajectory and directed toward that portion of G where 
n+1 n+1 ^ 

S c.x. ^ E c.x, (T). The coefficients a, may be expressed as a 
1111 i 

^ 1 

linear combination of the c^ and b^(x (T)), the latter being coeffi- 
cients of a hyperplane through x^(T) bracketing G. 

Thus 

p.(T) = -Xc. -jub.(x^T)) (11) 

where X and fjL are non-negative numbers one of which may be set 
equal to unity as it is only the ratio that is important. 

Generally, three situations arise as to final boundary conditions 

(i) If x^(T) are specified for i = 1,2, ,m then these 

become the boundary conditions for (9). 

(ii) If x^ (T) are internal points of G for i (1 ^ i ^ n+1) 
then b^.(x^(T)) = 0 and p^(T) = -c^. 

(iii) If x^(T) are boundary points of G for some i (1 ^i^n+1) 
then F(x(T)) = 0 and the p^(T) are as in (11). 

When F is differentiable, the bracketing hyperplane through 
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( 12 ) 



has coefficients 



b^(x'(T)) = |^ 



X. = xJ(T) 



If finding the optimum control for minimum transit time another condi- 
tion must be fulfilled since T is not fixed beforehand. This condition 



is that H(T) = 0. 



3 . Development of system equations . 

The eqiiation of a second order system with zeros may be written 
c + 2fcoc + 0>®c = a^u + agU (13) 

where c is the output variable of the system and u is the output of 
a controller. 

This paper is concerned with control of similar systems that are 
purely oscillatory in nature i.e. C = 0. To facilitate ease of com- 
putation in the analysis, equation (13) is scaled to 

c' + c = a^u + u (14) 

which when written in terms of the Laplace transform of the output 
variable becomes 

(a s + l)U(s) 

C(s) = - -3 ^ (15) 

This system is represented in block diagram form in Fig. 1, 
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CONTROL 


u 


ai s + 1 


c 






s^+ 1 






h “ 













Fig. 1 - Block diagram of control system 
The response of the system to a step input is investigated more 
readily by means of the error variable 

e = r - c (16) 

If the input r is fed forward as in Fig. 2, the Laplace transform of 
the error, given 

c(0) = c° 

c(0) = c° 

R(s) = Tq/s (17) 

becomes 

£(g) = (rn - c°)s - (6° + airp) - (ais + l)U(s) 

s® + 1 

Now the problem of zeroing the error states reduces to that of zeroing 
the error initial conditions in the system. 
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Fig. 2 - Controlled system with input fed forward 
Finally with the introduction of state space variables 



ei = -e 

G2 — Gi 



(19) 



the system equations can be written in vector matrix notation /3/ 

( 20 ) 





0 1 




0 


G = 




e + 




— 


-1 0 




aiu + u 



4 . The minimum time problem. 

The problem is stated as follows: 

Given the system (20) and a control force of bounded 
magnitude |u| ^ 1, find the optimum control ii*(t) to 
transfer the state variables from some initial point in 
the phase space to the origin of the phase space in 
minimum time T. 
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That is, given 



e( 0 ) = e° 
e(T) = 0 

|ul ^ 1 

and the system ( 20 ), find u?'(t) such that 
S = adt 

is a minimum where a is a positive constant. 
Introduce 



'n +1 



= ea = S = r adt 



The system equations then become 





"0 


1 


0 " 




0 


^ = 


-1 


0 


0 




ai u + u 




_0 


0 


0 




a 



( 21 ) 



( 22 ) 



(23) 



(24) 



Because of (21) , the functional 

3 

S = E c e.(T) = Cae 3 (T) (25) 

1 ^ 

and since we wish to minimize this, C 3 = 1 is chosen. 03 (T) is not 
limited, hence the boundary condition becomes 

P3(T) = -C3 = -1 (26) 

By ( 8 ), the ham iltonian becomes 



H = Pies - Psei + Ps(aiu + u) + Patt 



(27) 



Since Pa = 



-5H 
d ea 



= 0 it is evident that pa is a constant and there- 



fore Pa = Pa(T) = -1 and now H is 
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I 



H = Pies - Psei + P 2 (aiu + u) - a 



(28) 



which is maximized in u if 

aiii + u= N[sgn Ps] (29) 

where N = max |aiU + u] for each fixed t(0 ^ t ^ T). The control 
u*(t) which satisfies these conditions is a "bang-bang" type control 
where u = ±1 at all times and u at the moment of switching is 
unbounded . 

Since es has served its purpose in the optimization process, 
we may now return to the second order system and solve for the 
"impulse" variables. By (9), 



£ = 



0 

-1 



1 

0 



E 



(3 0) 



and the solution for u*(t) becomes 

u*(t) = 1 • sgn[cos(t + ©) ] (31) 

where © is a phase angle dependent on x°* 

Several properties of the optimum controller are now known. 
First, the control is a bang-bang type which applies maximum effort 
at all times in one of the two "directions". It is switched periodi- 
cally from one state to the other every half cycle until the origin is 
reached. Notice that each time the control is switched, a disconti- 
nuity appears in the es variable. This occurs because u contains 
an impulse. 
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where A eg is the discontinuity in eg at the time of switching t^. 

One would now like to find a switching curve L(e) which 
divides the phase plane ei vs eg in such a manner that control 
u* = +1 is optimum in the space to one side of the curve and u* = -1 
elsewhere. Control would be switched when the trajectory (t) 
crosses the curve. The discontinuity A eg precludes this possi- 
bility. For example, examine the trajectory ^*(t) for some initial 
conditions that dictate u* = -1 for optimum control. At the point 
where this trajectory crosses L(e) the optimum becomes u* =+l. 

The control switches and Aeg = +2ai occurs which places the states 
back in the space where u* = -1 was optimum. Here the control 
switches again, Aeg = -2ai occurs and chatter motion begins. The 
fact that eg is multiple valued at the instant of switching makes a 
simple realization of L(e) impossible. 

For periods between switchings where u = 0, the system is well 

th 

behaved with the solution for the k interval 

ei(t) = K cos (t + 0it) - 6 



(33) 



eg(t) = K cos (t + 01 c + tt/2) 

where 6=1* sgn pg and K,0ic depend on conditions of states at 
th 

the start of the k interval. 



4 . 1 The transformed variable. 



The search for a variable of the system on which to control leads 

to the possibility of "subtracting out" the discontinuity present in 

es at times of switching. 

The Laplace transforms of the system variables are 

T. /_x eis + ei + (ais + 1) U(s) 





Es(s) 



ejs - ei + s(ais + 1) U(s) 
s® + 1 



(34) 



where 



U(s) = 6(-^ 




(35) 



which for any instant of time t (0 ^ t < ti) 
U(s) = f 

Equations (34) then become 

r> /_\ _ e? + (ei + ai6) s + 6 
Ei(s) — ^ 

s (s^ + 1) 

= (ej + ai6) s + (-e? + 6) 
^ s® + 1 



(3 6) 



(37) 



By means of the initial value theorem, it is seen that 



lim ei (t) = lim sEi (s) = e° 

t“* 0 S“* ® 

lim e 2 (t) = lim sEg(s) = eS + ai6 
t-*o s-*® 



(38) 



At time t = 0, es jumps to ei + ai6 . To remove this discontinuity 
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consider the transformed variables 



Yi(s) — Ei(s) 

Ys(s) = Es(s) - ^ 



( 39 ) 



By virtue of (39) and (20) 



sYi(s) = sEi(s) = E 2 (s) = Ys(s) + 
sY.(s) = sE^(s) - ai6 = -Y^(s) + 



(40) 



or 



0 1 



ai 



(41) 



where 6 is a unit step function with sign to be determined. The 
system (41) is identical to that of (20) except for the action at 
time of switching. It should be noted, however, that care must be 
taken in assigning final values to the system described by (41) if 
the two plants are to be controlled in parallel. The final value 
theorem and (39) gives 



From this it is observed that zeroing the final states in (20) is 
analagous to zeroing yi (T) and attaining a final value 



lim ys(t) = lim sYs(s) = lim s(Eg(s) - 
t-»oo s-»o s-*0 




(42) 



ys(T) = -ai6 



(43) 



in the system (41). 
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4 . 2 Boundary conditions and final control. 



From (38) and (39) it is clear that the initial conditions on 
the e and y variables are identical. From (39) it is also seen 



that 



yi(T) = ei(T) 



(44) 



ys(T) = esd) - ai6(T) 

At this point in the pursuit of the optimum control, it becomes neces- 
sary to investigate the system action possible at time t = T under 
admissible control. Aea of (32) provides a means of changing the 
value of eg instantaneously by an amount dictated by the constraints 
on u(t). With this in mind, it is noted that appropriate use of Ae 2 
within the bounds of allowable control may zero the eg variable in 
zero time given that es(T) is within range. The conditions (21) 
and (44) with (32) indicate that for 



|ys(T)|^ai (45) 

the system (20) may be zeroed instantly^. The boundary conditions 
on (41) then become 

y.(0) = yj = i = 1 ,2 

yi(T) = 0 (46) 

|y 2 (T) I ^ ai 



The conditions are stated in terms of the y variable for con- 
venience in order that notational problems arising from multiple 
value of 02 ( 0 ) be avoided. 
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The final controller Us(T) that must zero the errors for t > T 



has two conditions imposed upon it i.e. , 



aiU2 + U2 — 0 
U2(T) - 6(T) = 
The solution to (47) is 



-62 (T) 

6l 



(47) 



U 2 (t) = -- y -— ■■ exp ( ) t^T (48) 

ai ai 

It is assumed then that U 2 (t) is available at time t = T so that the 
boundary conditions on the system are as stated in (46). 



4 . 3 Switching functions . 

The method of finding a function L(y) with which to describe 
the switching criteria for the optimum trajectory proceeds as follows. 
As in the discontinuous case, it is desired that 

S = adt (49) 

be minimized, therefore, another variable ya = S = C 3 ya(T) is 
adjoined to the system and once again Ci = C 2 =0. The hamiltonian 
becomes 

H = Piy2 + Piai6 - P2Yi + P36 - Oi (50) 

This is maximized in 6 when 

6=1- sgn (aiPi + P 2 ) (51) 

With this control, trajectories are circular about (6, -ai6) with 
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radius determined by (See Fig. 3) 




6 = +l 

> direction of 

positive time 



Fig. 3 Yi vs Ys phase plane with trajectories for 6 = ± 1 
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Previous arguments have determined that the conditions on the 



system are 



yj(o) 

yi(T) 

lysd) 

H(T) 

Pa(T) 



= 

= 0 
^ ai 
= 0 
= -1 



i = 1,2,3 



(52) 



The function F = ^ (y| - af) ^ 0 may be used to describe G. 



From this 



b2(yi(T)) = 



SF 

9y2 



ys 



= ys 



= ys 



(53) 



and 



P2 (T) = -X C2 - Mt»2 (y2 (T) ) = -MY2 (54) 

where n ^ 0 with modulus such that F(T) = 0. In the phase plane 
of Pi vs P 2 it is sufficient to note that for trajectories terminating 
at y 2 (T) = -ai, P 2 (T) ^ 0 and for trajectories ending at y 2 (T) = +ai, 
P 2 (T) ^ 0. This information in addition to the control (51) com- 
pletely define L(y) for trajectories ending on the extremes of the 
line segment |y 2 (T) I ^ ai. 

Fig. 4 depicts representative action for optimum trajectories 
terminating at y 2 (T) = -ai, yi(T) = 0. Trajectories ending at 
y 2 (T)=+ai, yi(T) = 0 are mirror images. The optimum switching 
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curves are generated by picking arbitrary values of £(T) from the 
admissible set for the corresponding boundary values of y_(t) and 
working backwards in time plotting the switching points determined 
from £(-t) on the yi vs Yz phase plane. 
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91 



Fig. 4 - Concurrent action of £(t) and x(t) for y (T) 



For trajectories ending in the interior of the line segment where 
lys(T) 1 ai / bs (yg (T) ) = 0 and, therefore, Ps(T) = 0. This com- 
pletes the information necessary to describe L(x). Fig. 5 shows a 
representative trajectory arrived at by translating switching criteria 
from the £ - plane to the x “ plane. Fig. 6 portrays the curve with 
all dimensions. 




switching curve 



Fig. 5 Pi vs pg and yi vs yg phase planes with complete switching 
curves 
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Fig. 7 - Optimum trajectories of ^(t) and ^.(t) (minimum time ) 
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Fig. 8 - Optimum trajectories of ^(t) and y^(t) (minimum time) 



I 



23b 










Fig. 9 - Optimum trajectories of ^(t) and 2 £^(t) ( minimum time ) 
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5 . The minimum fuel problem. 



The minimum fuel problem is solved by minimizing the integral 

J = du| + |aiu I) dt (55) 

•*0 

in the system (20) where T is not specified. It appears simpler, 
however, to once again make use of the transformed variable x(t)> 
By minimizing 

T 

J=[ |u|dt (56) 

in the transformed system (41), the desired result can be obtained 
provided 

i) the switchings in the time interval 0 ^ t ^ T are 
kept to a minimum. 

ii) adjustment is made at time t = T when fuel is 
consumed zeroing the error states e.(t) with the 
exponential control U 2 (T). 

After adjoining (56) to the system (41), the hamiltonian 
becomes 2 



H = Pxy2 - P2yi + u(axPx + Ps) - I u | (57) 

Since T is not specified H(t) = 0. With u(t) constrained as 
before, the control that maximizes H with respect to _g^(t) is: 
u* = 1 • sgn (axPx + Ps) kxPx + Ps | >1 

(58) 

u* = 0 |axPx + Ps I <1 
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5 . 1 Initial Conditions . 



Taking the time derivative of H 



dH / , . du d u 

— - (aiP, + Ps) -57 - 



( 59 ) 



^ J J ^ 

it can be seen that - 7 — = 0 if -r- =0. It may also be argued that 

dt dt 

the change in the hamiltonian with time is zero if 



dlul aIuI 
+ Pa = —i — = — ■ — ' 



a.P. + P= = -fr (60) 

Since u(t) is switching between u = 0 and u= ±1 and vice versa ^ 
this means that the hamiltonian remains constant if the control is 
switched at axPi + P 2 = 1 * sgn (Au), (See Fig. 10). 




Fig. 10 Switching criteria in Pi vs Ps phase plane 
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By choosing control u*(t) the hamiltonian remains at its maxi- 
mum value i.e. identically zero from time t = 0^ after initial control 
has been applied until time t = T. This control minimizes the 
integral (56) but does not necessarily minimize total fuel when fuel 
consumed at switchings is added. In order to minimize switchings, 
it appears necessary to choose the degenerate case i.e. u = 0 
until such time as aiPi + ps = 1 * sgn (Au) where Au is the change 
in u(t) when turning the control on. Notice that this choice 
guarantees that H(t) = 0 for all t, 0 ^ t ^ T. With this in mind, 
the problem remains to minimize fuel in the non-degenerate case. 

For this purpose it will be considered that time t = 0 is that time 
when 

aiPi + Ps = 1 • sgn (Au) (61) 

and initial control is applied. 

At t = 0 it may be verified from (61) and because H (0) = 0 

that 

PiyE - p 8 xf = 0 (62) 

5 . 2 Final boundary conditions . 

In order to investigate final value boundary conditions, the 
optimum trajectories terminating such that yi(T-At)>0 are consi- 
dered. Trajectories in the rest of the space are mirror images. As 
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in the minimum time problem, an optimum trajectory terminating at 
y 3 (T) = -ai,yi(T) = 0 is investigated first. The determination that 
Ps(T) ^ 0 as argued in (54) is still valid. This condition on Pa(T) 
along with the fact that H(T) = 0 precludes the possibility of a 
trajectory terminating as above with u(T) = -1. The following cases, 
however, do apply. Consider 

H(T) = -aiPi(T) + u(T) [aiPi(T) + Pa(T)] - |u(T) 1 = 0 (63) 

This condition implies that if u(T) = 0 then Pi(T) = 0 and if 
u(T) =+l then Pi(T) ^ 0 and Pg(T) =+l. Fig. 11 portrays the locus 
of admissible points £.(T) and the switching curves generated by 
these criteria in the yivsya phase plane are as in Fig. 11a. 




I Fig. 11 Admissible points £_(T) for ya(T) = -ai 
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Fig. 11a Switching criteria for ys (T) = -ai (minimum fuel) 
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Optimum trajectories terminating on the line segment yi(T) = 
0 / ly 2 (T) |< ai must be investigated in a fashion similar to that 
used with the minimum time problem. Since a final boundary point 



ya(T) is not fixed, we may substitute a final condition on Pa(T) 
to reach a solution. At this point it becomes necessary to decide 
on the final value functional to be minimized. It is first noted 
that if the final control to the line segment is u(T) = 0, then 
-ai<y 2 (T) <0. (It must be remembered that investigation is of 
trajectories such that yi(T-At)> 0). If u(T) = 0 then also ys(T) = 
ea(T) and in order to minimize the fuel consumed by Ua(T) to zero 
ea after time T then lea(T)| = |ya(T)| must be minimized. 

If the final control is u(T) = -1 (u(T) = +1 is not possible for 
trajectories terminating on this side of the line segment) then 
e 2 (T) = y 2 (T) - ai and, therefore, |ya(T) - ai | must be minimized. 

In both of the above cases, it may be seen that ya(T) must be 
maximized on the line segment in order that fuel consumed by 
Ua(T) to zero the error states be minimized. Therefore, the functional 
to be minimized is 



By prior arguments Pa(T) = -Ca = +1 and PaCT) = Pa(t) = -C 3 = -1. 



3 

S = Sc y (T) = -ya(T) + yg (T) 
1 



(64) 



where 




t 



(65) 
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5. 3 Generating the switching curve segments . 



It is now helpful to look at the hamiltonian under each of the 
above conditions, i.e. , u(T) = 0 and u(T) = -1. In the first case 
u(T) = 0 

H(T) = H(t) = 0 

yi(T) =0 (66) 

Ps(T) = +1 

-ai<y3(T)<0 

and 

H(T) = Pi(T)y2(T) = 0 

which implies that Pi(T) = 0. Fig. 12 shows the switching gener- 
ated by this condition. 




Fig. 12 Switching criteria for u(T) = 0 (minimum fuel) 
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Next is considered the case where 



and 



u(T) = -1 
H(T) = H(t) = 0 
yx(T) = 0 
Ps(T) = +1 



( 67 ) 



H(T) = Pi(T)y 2 (T) - l[axPi(T) + l] - 1 = 0 



from which 



Pi(T) = 



2 

y2(T) - ai 



( 68 ) 



Since u(T) = -1 and P 3 (T) =+l, conditions (58) are met only when 
-2 

Px(T) < which implies y 3 (T) >0. In Fig. 13 these trajectories 

ax 

and switching curves are plotted. 
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5 . 4 The complete switching curve . 



Because T was never specified and because the fuel consumed 
at switching was handled as a side condition, a composite of all the 
calculated switching curves indicates areas in the phase plane where 
criteria for optimum control appear contradictory. In these areas, 
analysis by graphical means or actual computation will clear up the 
situation. Fig. 14 depicts the composite of the first two criteria 
analized. 




Fig. 13a Switching criteria where u(T) = -1 



31 



In Fig. 14, region A is an area where there is a question con- 
cerning whether it is optimum to switch for [y8(T) 1 = ai or 
lysCT) I < ai. By graphical analysis, it may be seen that it is opti- 
mum to switch so that |y8(T) | = ai. 

A similar contradiction between trajectories switching for 
0 < y8(T) < ai and -ai<y8(T) < 0 may also be resolved graphically.* 
The final result consisting of switching criteria to zero the errors 
in the system (20) with minimum fuel is given by Fig. 15. 




Fig. 14 Region of conflicting optimum criteria 



♦Appendix I presents computational analysis of the resolving 
process . 
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Fig. 15 Switching criteria for minimum fuel, ai=1.0 



33 



6. Realization of control. 



With L(j^ providing the switching logic everywhere except 
where yi(T) = 0 and ly 2 (T)|<ax and with Us available for final 
action, the controller can now be built (theoretically). 




Fig. 16 Block diagram of the controlled plant 
By subtracting ai6 from e 2 (t), the variable yg(t) is gener- 
ated for use in control logic (Fig. 16). At time t = T, Sw 2 
designates ug as the control and errors are instantaneously zeroed. 
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The complexity of the logic necessary to implement the switch- 
ing criteria may lead one to desire a simpler, quasi-optimum control. 
Two such controls are pictured in Fig. 17 for minimum time problem. 
Although detailed investigation of these controls was not carried 
out, it is submitted that both controls are close to optimum especially 
for large initial disturbances in the error space. Both controls were 
designed with the thought that chatter motion would not be tolerated, 
switching function would be linear over a large range, and exponen- 
tial control was available at end point of trajectory. 

It is further suggested that the system might be controlled on 
the error states Instead of the transformed variable if constraints are 
put on time intervals between switchings so that chatter motion is 
avoided . 
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switching curve 



>1 




/ 



> 
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Fig. 17 Quasi-optimum control surfaces (minimum fuel) 



t 




7 . Conclusions. 



The methods used in this paper to arrive at a solution may be 

th 

used to good advantage in the investigation of any n order 
system with no more than n-1 zeros. The maximum priciple pro- 
vides a powerful tool in optimization, particularly for linear systems. 
Often the method of Pontryagin will indicate areas of interest to 
investigate when searching for an optimum control even if the unique 
solution is not readily forthcoming. 

The problem of controlling a plant with zeros is analagous to 
controlling a plant without zeros using an impulse-step type 
controller. Results obtained in this paper can be adapted to formulate 
the logic of this type control. 

The realization of the true optimum switching logic in a practical 
system may in many cases not be worth the effort. Quasi-optimum 
control using simple switching functions that are for the most ffart 
linear is a subject for further investigation. Settling time forthe sys- 
tem is relatively insensitive to limited variations from the optimum 
when trajectories are out beyond the first cusp of the switching 
curve. The two quasi-optimum controls suggested in this paper 
concentrate on avoiding chatter motion. It may be that a control 
using "controlled chatter motion" /2/ would be acceptable in parti- 
cular systems . This type would be particularly attractive for control 
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of trajectories in the area of the first cusp when linear control 



functions are desired. 
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Appendix I 



Graphical analysis of optimum criteria interface 
Given region B of Fig. 16 the problem is to find which 

of the two possible switching criteria is optimum, i.e. , whether 
for trajectories such that yi(T - At)>0 it is optimum to switch for 
0 < ys (T) < ai or -ai < ys (T) < 0 . 




Fig. 16 Region of conflicting switching criteria 
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Consider on curve 1 of Fig. 16 a distance R from the 
origin. Does it take less fuel to apply u(0) = -1 and zero the 
corresponding values of e_ or should the degenerate case be 
chosen until such time as curve 2 is reached (where yi = 0 and 
yk = R) and then switch? The fuel consumed when switching from 
curve 2 is 

ai when control is turned on 

y- if/2 

n/2 for lu|dt 
**0 

ax when control turned off at ys = 0 

|ys (T) I to zero corresponding es (T) 
which when totalled is equal to R + n/2. 

The fuel consumed when switching from curve 1 is 
ax when control is turned on 




l^i ” ys(T) I to zero corresponding e 2 (T) 

The time T which is the time for the Y_{t) to progress from y/ 
at curve 1 to the line segment may be portrayed by use of £(t) 
as in Fig. 17. 
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Fig. 17 Time T as a function of 0 ^ and 03 in £ plane 



Here it is remembered that 
Pi(T) 



72 (T) - ai 
P2(T) = +1 

aiP? + pi = -1 

T = 01 + 02 + TT//2 



(69) 



and that 



H(0) = p?yi - ply? = 0 



which implies 



Pi = 7i 

"pi” "yf 



(70) 



The total fuel from l then is ai + 0i + 0s + ff/2 + |ai - ys(T) 1 and 
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the angles 0i and 02 may be related to points in the yi vs Yz 



space . 



j -1 Px 

01 = tan - — ^ = tan 




( 71 ) 




The point at which equal fuel is consumed when switching on curve 
1 or curve 2 is that point where 



For R > R* optimum switching is on curve 2 . 

Computation of an example in the case where ax = 1.0 revealed; 



yf = -1.355 
yS = 2.140 
yi(T) = 0.0 
ysd) = 0.347 
R* = 2.533 
Fuel = 4.104 



R* — 2ax - y2(T) +01+02 



(72) 
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